#policy optimization30/06/2025
DSRL: Steering Robot Policies via Latent-Space Reinforcement Learning for Real-World Adaptation
DSRL introduces a novel method to adapt diffusion-based robotic policies via latent-space reinforcement learning, significantly boosting real-world task performance without modifying base models.